Search results for "Speech enhancement"

showing 4 items of 4 documents

A Sub-Symbolic Approach to Word Modelling for Domain Specific Speech Recognition

2006

In this work a sub-symbolic technique for automatic, data driven language models construction is presented. Such a technique can be used to arrange a language-modelling module, which can be easily integrated in existing speech recognition architectures, such as the well-found HTK architecture. The proposed technique takes advantages from both the traditional LSA approach and from a novel application of a probability space metric known as "Hellinger's distance". Experimental trials are also presented, in order to validate the proposed approach.

Computer sciencebusiness.industrySpeech recognitionMachine learningcomputer.software_genreDomain (software engineering)Speech enhancementMetric (mathematics)Artificial intelligenceLanguage modelHellinger distanceHidden Markov modelbusinesscomputerNatural languageWord (computer architecture)Seventh International Workshop on Computer Architecture for Machine Perception (CAMP'05)
researchProduct

Towards a Non-Intrusive Context-Aware Speech Quality Model

2020

Understanding how humans judge perceived speech quality while interacting through Voice over Internet Protocol (VoIP) applications in real-time is essential to build a robust and accurate speech quality prediction model. Speech quality is degraded in the presence of background noise reducing the Quality of Experience (QoE). Speech Enhancement (SE) algorithms can improve speech quality in noisy environments. The publicly available NOIZEUS speech corpus contains speech in environmental background noise babble, car, street, and train at two Signal-to-noise ratio (SNRs) 5dB and 10dB. Objective Speech Quality Metrics (OSQM) are used to monitor and measure speech quality for VoIP applications. Th…

Context modelVoice activity detectionNoise measurementComputer scienceSpeech recognitionMean opinion score020206 networking & telecommunicationsSpeech corpus02 engineering and technology01 natural sciencesBackground noiseSpeech enhancement0103 physical sciences0202 electrical engineering electronic engineering information engineeringQuality of experience010301 acoustics2020 31st Irish Signals and Systems Conference (ISSC)
researchProduct

Implicit Wiener Filtering for Speech Enhancement In Non-Stationary Noise

2021

Speech quality is degraded in the presence of background noise, which reduces the quality of experience (QoE) of the end-user and therefore motivates the usage of speech enhancement algorithms. A large number of approaches have been proposed in this context. However most of them have focused on the case where the noise is stationary, an assumption that seldom holds in practice. For instance, in mobile telephony, noise sources with a marked non-stationary spectral signature include vehicles, machines, and other speakers to name a few. On the other hand, the usage of frequency-domain information in existing algorithms for speech enhancement in non-stationary noise environments can be made mor…

Noise powerComputer scienceSpeech recognitionWiener filterSpectral densityComputer Science::Computation and Language (Computational Linguistics and Natural Language and Speech Processing)Context (language use)Background noiseSpeech enhancementNoisesymbols.namesakeComputer Science::SoundFrequency domainsymbols2021 11th International Conference on Information Science and Technology (ICIST)
researchProduct

Speech Activity Detection under Adverse Noisy Conditions at Low SNRs

2021

Speech originating from the noisy environments degrades the speech quality and intelligibility, thus reducing the human perceived Quality of Experience (QoE). For example, surveillance using drone during natural catastrophe needs an efficient speech recognition device to recognise the speech of the frozen human in presence of drone noise to save their life. Therefore, it often requires to pre-process the noisy speech in order to reduce the noise artifacts and enhance the speech. This paper detects the speech activity using Voice Activity Detection (VAD). The VAD distinguishes speech activity (speech presence) and speech inactivity (silence/noise) by extracting the speech features and compar…

Speech enhancementEuclidean distanceNoiseVoice activity detectionNoise measurementComputer scienceSpeech recognitionFeature extractionSpectral centroidIntelligibility (communication)2021 6th International Conference on Communication and Electronics Systems (ICCES)
researchProduct